Document Set Redundancy Compression Method Using Template Differential
نویسندگان
چکیده
Document image information systems are used more and more in government. Much redundant information in the document existed in such systems. That implies the research on the compression method based on the page-page statistical features is quite significant. Set Redundancy Compression (SRC) is such a technique that reduces the total entropy of the whole image set by utilizing the image page’s similarity. Compression-based Template Differential (CTD) is an improved SRC. The similar image set is constructed by the document template. The coding performance is improved by adding the template image into the Min-Max Differential (MMD) coding/decoding model. It proves theoretically that CTD’s coding performance is higher than MMD’s. It is demonstrated by experiments that both the CTD and MMD are benefit to increase the compression ratio of image set, however CTD increases more than MMD.
منابع مشابه
Proceedings of the International Conference on Image Processing , 1996 STRUCTURE - PRESERVING DOCUMENT IMAGE COMPRESSIONOmid
Maintaining a document in image form is often preferable in order to avoid the high cost of manual conversion or the introduction of large numbers of errors by automatic OCR and/or graphics interpretation. The large volume of data in the image can be greatly reduced by using compression techniques. Text-intensive document images typically have a great deal of redundancy in the bitmap representa...
متن کاملStructure-preserving document image compression
Maintaining a document in image form is often preferable in order to avoid the high cost of manual conversion or the introduction of large numbers of errors by automatic OCR and/or graphics interpretation. The large volume of data in the image can be greatly reduced by using compression techniques. Text-intensive document images typically have a great deal of redundancy in the bitmap representa...
متن کاملOn the Effectiveness of using Sentence Compression Models for Query-Focused Multi-Document Summarization
This paper applies sentence compression models for the task of query-focused multi-document summarization in order to investigate if sentence compression improves the overall summarization performance. Both compression and summarization are considered as global optimization problems and solved using integer linear programming (ILP). Three different models are built depending on the order in whi...
متن کاملMulti-Document Summarization By Sentence Extraction
This paper discusses a text extraction approach to multidocument summarization that builds on single-document summarization methods by using additional, available in-, formation about the document set as a whole and the relationships between the documents. Multi-document summarization differs from single in that the issues of compression, speed, redundancy and passage selection are critical in ...
متن کاملA Comparison of Set Redundancy Compression Techniques
Medical imaging applications produce large sets of similar images. Thus a compression technique is necessary to reduce space storage. Lossless compression methods are necessary in such critical applications. Set redundancy compression (SRC) methods exploit the interimage redundancy and achieve better results than individual image compression techniques when applied to sets of similar images. In...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JSW
دوره 7 شماره
صفحات -
تاریخ انتشار 2012